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Abstract 

We consider the following statistical problem: based on an i.i.d. sample 
of size n of integer valued random variables with common law /i, is it 
possible to test whether or not the support of is finite as n goes to 
infinity? This question is in particular connected to a simple case of 
Tsirelson's equation, for which it is natural to distinguish between two 
main configurations, the first one leading only to laws with finite support, 
and the second one including laws with infinite support. We show that 
it is in fact not possible to discriminate between the two situations, even 
using a very weak notion of statistical test. 

1 Introduction 

In this paper, we consider an i.i.d. sample of integer valued random variables 
with common law /i and address the following problem: is it possible to build 
an asymptotic test for the finiteness (or for the non finiteness) of the support 
of fi as the sample size goes to infinity? This question is in fact motivated by 
Tsirelson's equation in discrete time, see [TJ [5]. Let {vk)ke-n be a sequence 
of probability laws on the torus T = R/Z. A process {'qk)k^-n taking value 
in T is a weak solution of Tsirelson's equation associated to {i'k)k&-n if for all 
k G — N, the random variable ^fc — rjk — r/k-i has law Vk and is independent of 
= <j{r]n, n < k — 1). If moreover for all k G — N, J"^ = ^k' ^'^^^ process 
{r]k)kG-N is said to be a strong solution. It is proved by Yor in [5] that there is 
always existence of a weak solution to the equation: if U and (k, k & — N, are 



independent random variables taking values in T such that U is uniform and, 
for all fc, Cfc has distribution Vk, then the process (77^)/tg_N defined by 

Vo^U, V-n^U^Co C-n+1 foralln>l, (1) 

is a weak solution of Tsirelson's equation associated to {h'k)ke-N- 

In this work we consider only the simple case where Vk — for all k. It can 
be shown that in terms of existence and unicity of strong or weak solutions to 
the equation, depending on the law v, three very different situations have to be 
considered, see for details: 

- Case 1: If is a Dirac measure: existence of a strong solution, no unique- 
ness in law. 

- Case 2: If the support of contains at least two points and there exists 
an integer p > 2 and x G T such that it is included in x + {k/p, k = 
0, . . . ,p — 1}: no strong solution, no uniqueness in law. 

- Case 3: If is a law which does not satisfy the conditions of Case 1 or 
Case 2: no strong solution, uniqueness in law. 

In [4l[5], Tsirelson's equation is seen as a cosmological caricature: rjk-i repre- 
sents the state of the universe at time fc — 1 and the action of the evolution 
process at time fc. In particular, the authors raise the issue of the "initial 
magma" that is the description of the cr-algebra 

71>1 

It turns out that it is necessarily a trivial u-algebra in Case 3. 

Our goal here is to give a statistical look at this cosmological caricature. More 
precisely, based on the historical observations of the state of the universe 
rio,ri-i,... ,r/-n, can we (at least partially) decide whether we are in Case 1, 
Case 2 or Case 3 ? Of course we can already remark that under Case 1, the ^fe 
are constant and under Case 2 the Pk = S.k — Cfe-i ^.re all rational. Therefore, if 
we observe two different values for the in the sample, we can conclude that 
we are not in Case 1 and if we observe one irrational pk , we can discard Case 
2. Consequently, if there are at least two different ^fc and one irrational pk, we 
can conclude that we are in Case 3. However, in practice, we have only access 
to rounded values so it does not really make sense to build a procedure based 
on the fact that some quantity is irrational or not. 

In order to rigorously answer the question, we use the notion of statistical test. 
We first consider an auxiliary problem, which is interesting on its own, namely 
the possibility of building a consistent test for the finiteness (or for the non 
finiteness) of the support of an integer valued distribution /i. based on an i.i.d. 
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sample {Xi, . . . , X„) with law fi. We prove that designing such a test procedure 
is in fact impossible, even using a very weak notion of consistent test. Then we 
show that a corollary of this result is the impossibility to build a consistent test 
in order to separate on the one hand the union of Case 1 and Case 2 and on the 
other hand Case 3. 

The paper is organized as follows. The framework we use in order to test for the 
finiteness (or for the non finiteness) of the support of a distribution based on 
i.i.d. data is detailed in Section [5J In Section [31 we suggest an a priori natural 
functional in order to build such a test. The impossibility of building a consistent 
test is stated in Section|3]together with the connection with Tsirelson's equation. 
The proofs are relegated to Section [5] 

2 Testing framework 

In this section we consider an i.i.d. sample (^i, . . . ,Xn) of integer valued ran- 
dom variables with common law fi. Our goal is to investigate the possibility 
of testing for the finiteness (or for the non finiteness) of the support of ^ as 
n goes to infinity. We refer to [3] for a detailed presentation of the notion of 
asymptotic statistical test. In the following, we write for the probability 
measure on the canonical space under which the canonical process (Xi)i>i 
is a family of i.i.d. random variables with law fi. Moreover, we denote by Vf 
the set of laws on N with finite support and by Voo the set of laws on N with 
infinite support. To fix idea, let us consider in this section the null hypothesis 
of the finiteness of the support against the alternative of the non finiteness of 
it (the following definitions can be adapted in an obvious way if the null and 
alternative hypotheses are switched). 



2.1 Uniform test 

In order to discriminate between the two situations, the first idea is to try to 
build a consistent test with asymptotic uniform level a, with a given in [0, 1). 
This means designing a rejection area Wn in N" such that 

lim sup sup [{Xi ,...,X„) eWn] <a (2) 

and for all £ Voo, 

limP,,[(Xi,...,X„) e VK„] =1. 

n 

This is clearly impossible. Indeed, for fixed n, any law /if" with fii e Vj can 
be approached with arbitrary accuracy (in total variation norm for example) by 
a law /if" with ^2 e 'Poo- 
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2.2 Pointwise test 



We now consider a much weaker notion of test: we replace ([5]) by 
sup lim sup [{Xi , . . . , X„) e Wn] < a. 

The statistical meaning of this notion of test is of course very arguable but our 
point of view is the following; we want to show that it is impossible to build a 
consistent test with given level a even in this very weak setting. 

Remark 1. If we do not consider all distributions with finite support but only 
distributions with support in [0, A^] for some known N, a test statistics can of 
course be easily designed. Indeed, it is then enough to consider the statistics 
max(Xi, . . . , Xn) A {N + 1): when Xi follows /ii G Vf it converges to the right 
endpoint of /ii which is smaller than A^, when Xi follows /i2 G Voo it converges 
to TV + 1. 



3 Candidate test statistics for pointwise test 

We want to investigate potential test statistics for discriminating between our 
two situations. Since the laws in Vf and Voo are different because they have 
different support, a natural idea is to design a test statistics based on the em- 
pirical support. For example, we can split the sample of size 2n, into two parts 
and compare the maximum over each subsample, that is we consider 

Sn = max Xi, Sn = max Xi, and = 1 r r, _c^■ 

l<i<n Ti+l<i<2n — ^nf 

If /i has finite support, it is clear that for n large enough, the maxima of the 
two subsamples coincide and in particular converges in probability to 1. In 
the infinite support case, one may expect an opposite behavior. In fact, we have 
the following result whose proof is given in Section [5] 

Proposition 1. If ii has infinite support, then T2n does not converge in P^- 
probability to 1. 

Therefore for any fii with finite support and fi2 with infinite support the statis- 
tics shows different asymptotic behaviors under fii and /i2. However, this is 
not enough to build a consistent test since we need to investigate the asymptotic 
probability that T2„ is different from 1. In fact, the first part of Theorem [T] in 
the next section implies that there exist a distribution fi2 with infinite support 
and a subsequence of T2n which converges to 1 in probability under P^^ . 

4 Impossibility of testing 

The following theorem shows that it is not possible to build a test for the finite- 
ness (or for the non finiteness) of the support. Indeed, it states that if one finds 
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a set in which any i.i.d. sample of size n from a law with finite support /ii is 
very unlikely to be, there is also one distribution fi2 with infinite support so 
that an i.i.d. sample of it is very unlikely to be in this set, and conversely. 

We write for the expectation with respect to P^. We have the following 
result. 

Theorem 1. Let a G [0, 1), let (/j : N* — > N* be an increasing map and let 
An : N"^'"^ — > [0, 1], n > 1, be a sequence of measurable functions. 

1. Assume that for any distribution fii on N with finite support, 

limsup [An{Xi, . . .,X^(n))] < a- 

n 

Then, there exists a distribution /i2 on N with infinite support such that 
liminf E^, U„(Xi, . . . ,X^fn))] < a. 

n 

2. Assume that for any distribution fi2 on N with infinite support, 

limsup E^ J A„(Xi, ... ,X^(„))] <a. 

n 

Then, for all a' € {a, 1) there exists a distribution jii on N with finite 
support such that 

limmf E^, [An{Xi, X^(„))] < a'. 

Remark 2. In Theorem [U we do not only consider the usual testing framework 
where the An are indicators of a set and >f{n) = n. Indeed, we allow for ran- 
domized test procedures and subsequences in the sample size. This is slightly 
more general and will be useful for the proof of Corollary [TJ 

We now come back to Tsirelson's equation and state the result showing the 
impossibility of testing the hypothese that v belongs to the union of Case 1 and 
Case 2 against the one that v belongs to Case 3, and conversely. Denote by P* 
the law on of the "uniform solution" (77^)fcg_N, given by ^ (when Vk = ^ 
for all k) and by {rjk)ke-n the canonical process on . We have the following 
corollary. 

Corollary 1. Let a G [0, 1), /ei : N — t- N &e an increasing map and let 
Bn C T''''^"^^"'^ be a sequence of measurable sets. 

1. Assume that for any distribution vi on T belonging to Case 1 or Case 2 
of Section [7J 

limsup P*^ [{rjo, . . .,V-<p(n)) e S„] < a. 

n 

Then, there exists a distribution 1/2 belonging to Case 3 such that 
liminf P:J(77o,-.- J V-ip{n)) G Bn\ < a. 
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2. Assume that for any distribution V2 on T belonging to Case 3 of Section 

m 

limsup P*^ [(?7o, . . ■ ,?7-<p(„)) ^ -B„] < a. 

n 

Then, for all a' € (a, 1) there exists a distribution vi belonging to Case 1 
or Case 2 of Section]^ such that 

liminf P*^ [(770, . • . ,r?_^(„)) G S„] < a. 



5 Proofs 

5.1 Proof of Proposition [1] 

Let /X be a probability measure on N and assume that P^ [T2„ = 1] — > 1. Since 
Sn and Sn are independent with the same law, it implies that there exists a 
sequence of integers fc„ such that P;_j[S'„ — fc„] 1. 

Let us show that kn is a bounded sequence, which will imply that the support 
of fj, is finite. If kn is not bounded, then there exist a subsequence such that 
ki+m > kng. for all i and km, 00. Therefore 

P/i[>5'n£ — kni\ >■ 1 

and 

P^[5i+„, =/c„J ^0 
because P[Si+ni = fci+jif) 1 and fci+n^ 7^ kn^- Moreover, we have that 

P/i = kn^ P/i['5'n^ = ^n^] 

is equal to 

This absolute value of this last quantity is smaller than +00)) which goes 

to zero as n goes to infinity. This shows the contradiction. 



5.2 Proof of Theorem [T] 

5.2.1 Proof of Part 1 in Theorem [1] 

We recursively define a sequence of distributions /i", n > 1, and a sequence of 
integers n > 0. 

— At rank n = 1, we consider ^\ the Dirac measure at point 0, 1/1(0) = 1 and 
^(1) = 1- 

— At rank n > 1, we define /ij and The law /ij is the distribution with 
discrete support {0, . . . , n — 1} defined by 
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with c„ such that 

n-l 

V - = 1. 

The integer ^l){n) is taken so that 

V'(n) > max(^(n — 1), n?) 

and for all m>il){n), 

Note that finding such a ?/'("•) is always possible thanks to the assumption on 
the sequence {Am)m- In particular, the sequence il^{n) is increasing and satisfies 



(i£> o ih(' ^^"^ < 00 



+ 00 

E(<^oV'(A:))2 

Now define the distribution ij.2 on N with infinite support by 



((/?o V(A;))2 



with c such that 

We have that 
is smaller than 



+ 00 



V 

^ ((no 



k=0 



1. 



E^jA^(„)(Xi,...,X^o^(„)) I fl {Xi<n-l}]+¥^,[ [j {Xi>n}] 

i=l j=l 

!po'ip(n) 

= E^j [^^(„)(Xi, . . . , X^ov,(n))] + [ U ^ '^^ 

i=l 

+ 00 



1 1 

< a H 1- o ■(/)(n) -. , , „ . 



Using the fact that tp{n) is increasing and the inequality tlj{n) > n^, we obtain 

+00 -j^ +00 ^ ^ +00 ^ 

a-\ \-ipo ipln) , ,, < a H h -77^ < a H 1" T?- 

k — Th k — 71 k — Ti 

This quantity goes to a as n goes to infinity, which gives the result. 



5.2.2 Proof of Part 2 in Theorem [T] 

Let a' € (a, 1). Assume that for any distribution fii on N with finite support 

hmmf [An{Xi, ^^(„))] > a'. 
This last inequaUty is equivalent to 

limsup [l - An{Xi, . . . ,X^(„))] < 1 ~ a . 

n 

According to point 1. of Theorem [1] there exists a distribution /i2 on N with 
infinite support such that 

limjnf E^, [1 -^An{Xi,..., X^(„))] < I - a , 

that is 

limsup E^2 [An{Xi, . . . ,X^(„))] > a' 

n 

wich gives the contradiction since a' > a. 
5.3 Proof of Corollary [T] 

Let / be an injection from N into T n Q/Z. If /i is a probability measure on N, 
denote by f{fi) = ^ o the image of /i by /. On the one hand, one has: 

- if /i has finite support, /(/i) belongs to Case 1 or Case 2. 

- ii fi has infinite support, /(/i) belongs to Case 3 since /(/i) has infinite support. 
On the other hand, because of the definition of P*, one has 

P}(^) [im, • ■ • , V~v{n}) e Bn] = E^ [An{Xi, X^(„))] 

where ^„(Xi, . . . , X^p(^n)) is equal to 

/ 1b„ {u, u-f{Xi),u-f{X,)-f{X2), u~f{X,)-f{X2) du. 

Jo 

Using these two facts, together with Theorem [TJ the proof of Corollary [1] is 
easily completed. 

Acknowledgments 

We are very grateful to Kouji Yano and Marc Yor for introducing us to the 
statistical question linked to Tsirelson's equation. 



8 



References 

[1] Cirel'son, B.S., An example of a stochastic differential equation that has 
no strong solution, Teor. Verojatnost. i Primenen., 20(2):427-430, 1975. 

[2] TSIRELSON, B.S., My drift, Citing works., 

http://www.inath.tau. ac.il/^tsirel/Research/mydrift/citing. html 

[3] Van DER Vaart, A., Asymptotic Statistics, Cambridge University Press, 
1998. 

[4] Yano, K. and Yor, M., Around Tsirelson's equation, or: The evolution 
process may not explain everything. Preprint, 2010. 

[5] Yano, K. and Yor, M., A cosmological caricature derived from Tsirelson 
type equations. Preprint, 2011. 

[6] Yor, M., Tsirel'son's equation in discrete time, Probabihty Theory and 
Related Fields, 91(2):135-152, 1992. 



9 



